您现在的位置:首页 > 学术研究 > 论文发表 > A Federated CLIP Fine-Tuning Method Based on Optimal Transport and Dual Prompt Personalization
A Federated CLIP Fine-Tuning Method Based on Optimal Transport and Dual Prompt Personalization
[发布时间:2026-02-27  阅读次数: 27]

作者:Lei Shi, Zepeng Li, Xu Ding, Yingfei Zhu, Xin Yao发表刊物:Electronics

年份:February 2026

摘要:The Contrastive Language-Image Pre-training (CLIP) model uses contrastive learning to align image and text representations, and fine-tuning CLIP with federated learning can extend its application to professional fields. However, federated CLIP fine-tuning faces two key challenges: insufficient alignment of fine-grained semantics between vision and text modalities and poor adaptability to non-independent and identically distributed (non-IID) data. This paper proposes the Optimal Transport Dual Prompt Personalization (OTDPP) framework, injects prompt parameters into the deep networks of both visual and text encoders, achieves fine-grained cross-modal alignment through optimal transport, and designs a dual prompt tuning mechanism. The framework splits prompt parameters into a shared global part aggregated by the server and a private local part reserved by clients, and it enables personalized adaptation without updating large backbone encoders. Extensive experiments show that compared with classic prompt tuning baseline methods, OTDPP reduces computational and communication overhead, retains client-specific personalized features, significantly improves model adaptability and performance, and thus demonstrates broad application prospects.

参考文献拷贝字段:Lei Shi, Zepeng Li, Xu Ding, Yingfei Zhu, Xin Yao. A Federated CLIP Fine-Tuning Method Based on Optimal Transport and Dual Prompt Personalization [J]. Electronics. 2026,15(5), 972. DOI: https://doi.org/10.3390/electronics15050972


相关下载:
    A Federated CLIP Fine-Tuning Method Based on Optimal Transport and Dual Prompt Personalization